import packages

load dataset

data cleaning

simple approach (fill NAs)

KNNImputer (fill NAs)

MissForest (fill NAs)

KNN (not scaled)

KNN (scaled)

split the dataset (too much computation cost to do CV)

random forest

interaction b/t longitude & latitude

adaboost

gradient boost

MLP neural network